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Abstract: Fixational eye movements remain a major cause of artifacts in 
optical coherence tomography (OCT) images despite the increases in 
acquisition speeds. One approach to eliminate the eye motion is to stabilize 
the ophthalmic imaging system in real-time. This paper describes and 
quantifies the performance of a tracking OCT system, which combines a 
phase-stabilized optical frequency domain imaging (OFDI) system and an 
eye tracking scanning laser ophthalmoscope (TSLO). We show that active 
eye tracking minimizes artifacts caused by eye drift and micro saccades. 
The remaining tracking lock failures caused by blinks and large saccades 
generate a trigger signal which signals the OCT system to rescan corrupted 
B-scans. Residual motion artifacts in the OCT B-scans are reduced to 0.32 
minutes of arc (-1.6 jum) in an in vivo human eye enabling acquisition of 
high quality images from the optic nerve head and lamina cribrosa pore 
structure. 
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1. Introduction 

Optical coherence tomography (OCT) is an interferometric imaging modality that allows 
cross-sectional imaging of internal biological structures [1]. Since its invention, OCT has 
become a standard tool in ophthalmology to diagnose and study various eye diseases [2]. The 
noninvasive nature and high axial (3-5 urn) and lateral (5-30 jum) resolution are ideal to 
visualize the underlying structures and detect changes in the retinal tissue of healthy and 
diseased eyes. The emergence of spectral domain OCT [3,4], with a 100 to 1000 fold 
improvement in sensitivity [5-7] and the associated increase in scan speed allowing video rate 
imaging [8,9] has made a significant impact in the field. 

However, during image acquisition the eye is constantly moving. These involuntary 
fixational eye movements (e.g., micro saccades and drift [10]) remain a major cause of 
artifacts in OCT images even in ultrahigh-speed systems [11,12]. Post-processing algorithms 
have been developed to remove eye motion [13,14], but large and rapid eye movement can 
cause gaps in volume imaging which cannot be corrected with post-processing. Further, eye 
motion causes distortions and complicates the averaging of multiple B-scans in order to 
enhance the signal-to-noise ratio (SNR). 

Another approach to eliminate eye motion artifacts from the data is to monitor eye motion 
and correct the ophthalmic imaging system in real-time. Several methods have been 
developed over the years to track and quantify eye motion. The earliest methods measured 
motion of the anterior segment of the eye using magnetic search coils [15], movement of the 
reflection from the anterior optics [16,17] or were based on reflections from tight fitted 
contact lenses with tiny mirrors [18]. With the exception of the contact lens approach, these 
methods are still commonly used in many settings, but they are limited in the sense that the 
exact movement of the retina, and the associated retinal image, is never recorded directly. 

The first retinal image-based tracker measured the lateral motion of a blood vessel with a 
simple line-scan camera [19]. This was a precursor to the scanning laser ophthalmoscope 
(SLO), invented by Webb et al. [20,21] which offered a completely new tool in eye imaging. 
The confocality of SLO allowed the capture of high quality en face videos of the retina. The 
advantages of SLO for eye tracking were appreciated early on and systems were soon 
described for tracking retinal motion by the frame rate [22] and analyzing distortions within 
sections of individual frames [23,24]. The latter concepts and technologies set the stage for 
the correction of motion artifacts that we report in this paper. 

To overcome the problems that fixational eye movements cause in OCT imaging, several 
commercial systems have implemented eye tracking into their existing OCT devices e.g., 
Spectralis® OCT (Heidelberg Engineering, Germany), RTVue (Optovue Inc., USA) and 
tracking OCT from Physical Sciences Inc. (PSI) [25,26]. The Spectralis and RTVue systems 
use image-based tracking in which eye motions are extracted from video frames and the OCT 
beam is steered to keep it on target. In the Spectralis® system, the retinal image (1,000 points) 
from the infra-red SLO imaging channel is used to track motion. In the RTVue system the 
motion is measured at 30 Hz from an infrared full-field fundus camera. The PSI system uses a 
non-imaging method where a dithering beam is used to lock on reflectance changes from 
single retinal features within the optic nerve head (ONH) or on blood vessels [27]. The eye 
motion signal is used to control the OCT scanning mirrors to maintain the OCT scanning grid 
on its retinal target, except in the PSI system, where the beam is controlled with additional 
tracking mirrors. 

In this paper we present our existing optical frequency domain imaging (OFDI) system 
[28] with implemented experimental tracking SLO (TSLO) [29]. The TSLO images the eye 
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with a frame rate of 30 Hz but extracts eye motion at much higher rates by analyzing 
distortions within sections of each captured frame. These extracted eye motion signals are 
transformed into tracking signals that are combined with the signals that drive the OCT galvo 
mirrors. We present a careful performance analysis emphasizing tracking and stabilization 
accuracy of the TSLO. Further, we present images of the retina and the ONH that are free 
from micro saccades and drift. Finally, we demonstrate how large and rapid saccades or blinks 
that lead to untracked motion can be corrected by implementing a validity signal to the OFDI- 
TSLO system that enables rescanning of the retinal areas where tracking was lost. 

2. Experimental system 

2.1. Optical setup 

A detailed description of the OFDI system can be found in Braaf et al. [28] and optical layout 
can be seen in Fig. 1. It uses a 100 kHz swept-source (Axsun Technologies, USA) with a 
center wavelength of 1050 nm and an axial resolution of 4.8 um (6.5 um in air). For 
calibration and phase- stabilization a parallel, externally built Mach-Zehnder interferometer 
(illustrated as the MZI block in Fig. 1) is used to align A-lines to the same wave numbers (k- 
space) in post-processing. The beam diameter on the cornea was measured to be 1.62 mm 
(lie 2 ) with a knife edge method [30,31]. The corresponding theoretical diffraction-limited spot 
size on the retina was calculated to be 13.7 um. 

Technical details of the TSLO are described in Sheehy et al. [29] and the optical 
schematic is shown in Fig. 1. In summary, it uses a super luminescent diode (SLD-371, 
Superlum, Russia) with a center wavelength of 840 nm as the light source. The TSLO images 
the eye with 512 x 512 pixel frames acquired at 30 Hz over a field size of 4 degrees. A 
16 kHz resonant scanner (Electro-Optical Products Corporation) is used for fast-axis 
(horizontal) scanning and slow- axis (vertical) scanning is performed by a galvo scanner at 30 
Hz (Cambridge Technologies). The reflected light from the retina was detected with a 
photomultiplier tube (PMT). To provide tracking signals in real-time, the TSLO image 

Sheehy et al. Biomed. Opt. Express 2012 



Braaf et al. Opt. Express 201 1 




Fig. 1. A layout of the optical setup. SLD: Super luminescent diode; VS: Vertical scanner; HS: 
Horizontal Scanner; DM: Dichroic Mirror; PMT: Photomultiplier tube; MZI: external Mach- 
Zehnder Interferometer. The numbers are presenting the splitting ratios of the fiber couplers. 
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acquisition and processing was custom designed and uses the combined functionality of a 
GPU (GeForce GTX 560) and a Field-Programmable Gate Array (FPGA) board (Virtex 5, 
Xilinx). 

The OFDI and TSLO systems were combined via a dichroic mirror and the layout of the 
optical setup is shown in Fig. 1 . The optical power on the cornea was measured to be 1 .48 
mW (1050 nm) and 460 uW (840 nm), for which the combination was calculated to be within 
the ANSI standards for laser safety. The OCT sensitivity was measured to be 99.5 dB, which 
is 7.9 dB lower than the computed theoretical sensitivity of 107.4 dB. This decrease in 
sensitivity is due to optical losses in the sample arm and interferometer. 

A schematic showing how the information flows is presented in Fig. 2. In short, the TSLO 
images the retina at 30 frames/s and the motion from the SLO frames is extracted taking 
advantage of the GPU and FPGA to deliver a high quality tracking signal with minimal 
latency. The tracking signal is the inverse of the eye motion scaled to fit the OCT galvo 
scanner voltage range. An electronic summing junction adds the tracking signal to the OCT 
galvo scanner steering signals (marked as + in Fig. 2). This voltage scaling is explained in 
section 3.2. 

Eye motion extraction 

TSLO-PC 



Signals to control SLO 



SLO images 30 frames/s 



GPU 



FPGA 



Gain 



Tracking validity signal 



SLO 



840 nm 



Dichroic 
mirror 




1050 nm 



Tracking signal 



OCT * — (+> 



Signals to galvo mirrors 



I 


OCT- 


Motion corrected OCT 




PC 


image signal 







Fig. 2. An overview of the information flow. The TSLO images the retina at 30 frames/s and 
the eye motion is extracted in the TSLO-PC using the FPGA and GPU. The inverse eye motion 
signals are then scaled to match the voltage range in the OCT system (Gain). Tracking signals 
are combined with OCT beam steering signals in the electronic summing junction (+) to 
compensate for the eye motion in real-time. The tracking validity signal is used to indicate B- 
scans that need to be rescanned because tracking failures occurred due to large eye motions or 
blinks. 

2.2. Image stabilization 

The software that was used to extract the eye motion from TSLO images is similar to that 
described by Yang et al. [32] and Stevenson et al. [33] and is briefly explained here. First, a 
TSLO reference frame is selected. Then each consecutive frame is broken down into small 
strips which are cross-correlated one by one with the reference frame. The displacements of 
the strips in the horizontal and vertical directions provide the eye motion, which is inverted 
and used as correction signal for the OCT system. In the current setup, each frame is split into 
32 overlapping strips, each 32 pixels high and 512 pixels wide, which yield an eye motion 
reporting rate of 960 Hz. 

In the case of tracking failures, the TSLO-PC will send an invalid signal to the OCT-PC. 
Tracking failures are identified by sub threshold correlations of the current frame with the 
reference frame. Low correlations are caused by (i) large drifts, which limit the amount of 



#175077 - $15.00 USD Received 29 Aug 2012; revised 10 Oct 2012; accepted 10 Oct 2012; pub. 24 Oct 2012 

(C) 2012 OSA 1 November 2012 / Vol. 3, No. 1 1 / BIOMEDICAL OPTICS EXPRESS 2954 



overlap between strips in the current frame and the reference frame, (ii) large saccades, which 
cause excessive shearing distortions of strips in the current frame, (iii) vertical motion, which 
leads to loss of overlap between strips in the current frame with the reference frame, (iv) 
blinks and (v) misalignments of the pupil, which cause reductions in image quality. Upon 
receiving the invalid signal, the OCT initiates rescanning of earlier B-scans, so that valid B- 
scans are acquired over the entire volume image. Specifically, after receiving the invalid 
signal, the OCT slow-axis scanner steps back 10 B-scans and waits for the validity signal to 
become valid again. When the validity signal indicates that the tracking is working again, the 
acquisition of B-scans is continued. B-scans that are corrupted by failed tracking are logged 
and then removed in post-processing. 

2.3. Ethical considerations 

The use of our experimental setup for in vivo measurements in humans was approved by the 
local Institutional Review Board and adhered to the tenets of the Declaration of Helsinki. 
Informed consent was obtained from each subject. 

3. Tracking performance analysis 

For reference, the amplitude spectrum of typical eye motion during fixation is presented in 
Fig. 3. This spectrum was generated using the same image-based algorithms to measure eye 
motion from SLO videos as described in section 2.2, but was done offline with a more robust 
and higher frequency analysis (64 strips per frame or 1920 Hz). The spectrum was computed 
from ten, 1 0-second videos recorded from an eye that was fixating. The spectrum shows that 
the frequencies of eye motion drop in a \lf manner similar to that described in the literature 
[34]. Beyond 10 Hz, the amplitude of eye motion is generally less than 0.5 minutes of arc, 
which corresponds to approximately 2.4 jum in a normal eye. This spectrum establishes the 
frequencies and magnitudes that the TSLO should be designed to correct. 




1 10 100 

Frequency (Hz) 



Fig. 3. The frequency spectrum of eye motion that was measured from a healthy volunteer 
(adapted from Sheehy et al. [29]). The spectrum shows similar behavior as reported in the 
literature [33-35]. 

3.1. Tracking OCT performance 

TSLO performance was quantified by testing it on a model eye which was designed to 
simulate eye motion. The model eye consisted of a lens (f = 25 mm), an artificial retina made 
from translucent tape layers to provide texture in both lateral and axial directions and a 
galvanometer mirror (6200H, Cambridge Technologies, USA) in between. In the model eye, 
lateral motion of the retina was simulated by scanning the galvo mirror with variable 
frequency and amplitude. The motion generated in the model eye was typically faster and 
larger than the human eye naturally produces to test the TSLO under extreme conditions. 

The motion of the model eye was sinusoidal and varied from 1 to 128 Hz. The amplitude 
was adjusted to be well above typical human eye motion but low enough to avoid tracking 
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failures caused by excessive image distortions. The motion of the model eye was calculated 
from SLO measurements with and without tracking to determine the motion correction 
performance. The percentage of correction was computed as 100% minus the output motion 
divided by input motion. More details on this step are described by Sheehy et al. [29]. 

In the OCT tracking application, there is an additional error caused by the time delay, or 
latency, between the motion measurement and applying the correction to the OCT scanners. 
The latency of the TSLO system was measured to be 2.5 ms, which was computed from 
measurements of the phase delay between the actual motion (derived from the position sensor) 
of the galvo scanner in the model eye and the correction signals sent from the TSLO to the 
OCT scanners. 

Figure 4 shows the final performance of the motion correction for the OCT as a function 
of frequency, which was computed by combining the TSLO frequency performance (Sheehy 
et al. [29]) with the latency. The 50% (3 dB) motion correction bandwidth is just above 32 Hz 
which is sufficient to compensate most eye motion in a fixating eye. 
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Fig. 4. The magnitude of motion correction of the TOCT as a function of frequency. The plot 
was generated from the eye motion measurement performance of the TSLO (Sheehy et al. [29]) 
combined with the additional error caused by the latency between the eye motion measurement 
and the output of correction signals to the OCT galvo scanners. 

The velocity threshold of the eye motion for 4° field of view is approximately 14°/s [29]. 
Since saccades can have higher velocities than 14°/s [10], tracking failures caused by fast and 
large saccades do occur in the data sets. These failures will likely occur more frequently in 
inexperienced subjects and patients due to more saccadic motion during fixation. However, 
tracking failures are accounted for by the previously mentioned validity signal. 

3.2. Scaling of the correction signals 

An adjustable gain factor was added to the TSLO control software to correct for differences 
between the eye motion correction voltage outputs and the voltage response of the OCT 
scanners. 

To determine the optimal gain value, we acquired a series of B-scans of the model eye 
with a large amplitude sinusoidal oscillation. With the tracking system on, we adjusted the 
gain until we found a minimum in the residual motion. To quantify the residual motion, the 
series of B-scans was cross-correlated to a reference frame (first acquired B-scan). Figure 5 
shows the minimum residual motion achieved in the system when the input motion amplitude 
is ±17.4 minutes of arc at 1 Hz. The residual motion never reduced to zero because the B-scan 
continued to rock (tilt) in the axial direction even after motion correction, owing to the change 
in path length caused by the way we generated eye motion in the model eye. In addition to the 
tilt effect, the internal latency contributed to the residual motion. The amplitude of the fitted 
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sine curve is 0.5 arcmin and the lowest standard deviation (SD) of residual motion was found 
to be 0.37 minutes of arc. To make model eye motion comparable with the real eye motion, 
we took SD since eye motion is not periodic. In Fig. 4, 1 Hz motion is corrected by 98.4% 
which means that the residual sinusoidal motion amplitude from 17.4 arcmin should be 0.28 
arcmin (1.6%). This value is in good agreement with the experimentally measured value. 




Time (s) 

Fig. 5. Residual motion that is present in the OCT data after applying the optimal gain setting 
for eye motion compensation. The standard deviation is 0.37 minutes of arc, which corresponds 
to approximately 2.7 um (as comparison the spot size on the human retina was calculated to be 
13.7 urn). 

4. Imaging 

4.1. Performance of the TOCT imaging in the model eye 

Before imaging human eyes, motion correction in tracking OCT was tested with the model 
eye. In Fig. 6, 250 B-scans were taken with and without tracking. The motion in the model 
eye was a 1 Hz sinusoidal wave that generated a ±12.4 arcmin movement of the artificial 
retina. Each acquired B-scan was integrated over depth and displayed in sequence to create 
what we term a B-scan trace for untracked (A) and tracked (B) imaging. Comparing the B- 
scan traces from Fig. 6 it is apparent that the motion is well compensated. Without tracking a 
clear 1 Hz sinusoidal motion is seen, whereas with tracking the features in the B-scan trace 
remain stationary, forming straight lines. This is quantified with cross-correlation to measure 
the motion within the series of B-scans. In Fig. 6(C) the horizontal motion is plotted as a 
function of time. Without tracking the SD of the motion is 8.27 arcmin and with tracking it 
reduces to 0.40 arcmin. This is consistent with the value obtained from the correction signal 
scaling experiment in section 3.2. The visible features in B-scan traces are from tape layers 
that were used to make the artificial retina. 

In Fig. 7 a square area of the model retina was imaged under three different conditions 
using the same model eye as previously described. An en face image from each condition is 
presented. The left image shows the model retina imaged with no motion in the retina and 
tracking switched off (Fig. 7(A)). Figure 7(B) shows the motion artifacts when motion is 
induced and Fig. 7(C) shows how tracking corrects the motion while motion was put into the 
system. When comparing Figs. 7(A) and 7(C), it is seen that the original structure is recovered 
when tracking is enabled. 
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(A) (B) (C) 




Fig. 6. B-scans taken from the model eye over the course of 5 seconds. (A) B-scans were taken 
without tracking and the features can be seen oscillating in the trace image. Below the image, 
all 250 frames were averaged which resulted in a blurry cross-sectional image of the tape layers 
in the artificial retina. (B) The same location was imaged with tracking. It is clearly seen that 
the previously seen motion is compensated. Again, below the image, all 250 frames were 
averaged together and this produced a cross-sectional image of the tape layers with clear 
structure. (C) The first frame from each data set was taken as reference frame and the 
consecutive frames were cross-correlated to the selected reference. In C the horizontal B-scan 
motion is plotted as a function of time. The red curve is derived from A (no tracking) and the 
blue curve from B (tracking). The standard deviation for the untracked curve is 8.3 arcmin and 
for tracked 0.4 arcmin. Scale bars indicate 0.5 deg. 

(A) (B) (C) 




No motion, Motion, Motion, 

tracking OFF tracking OFF tracking ON 

Fig. 7. En face (B-scans integrated over depth) images of the model retina consisting of layers 
of tape under different conditions. (A) No motion is present and tracking signals are not 
generated. (B) System is introduced to a horizontal 1 Hz sinusoidal motion with an amplitude 
of ±12.4 arcmin, tracking is off. (C) Motion is the same as in B but tracking signals are 
generated and combined with OCT beam steering signals to compensate eye motion. In C it is 
seen that the original structure of the moving retina is recovered. Scale bars indicate 0.5 deg. 

4.2. Performance of the TOCT imaging in a real eye 

To measure the tracking stability in a human eye, 250 B-scans consisting of 2000 A-lines 
were acquired from the same location in a healthy volunteer. The SLO video recording was 
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triggered by the OCT data acquisition to synchronize data collection between the two imaging 
systems. The simultaneously acquired TSLO video was analyzed to determine the actual eye 
motion that took place during the acquisition of the OCT data set. The OCT measurement 
with this protocol took approximately 5 seconds. B-scan traces were generated with and 
without tracking (Fig. 8) in the same manner as for the model eye. In Fig. 8(A) the features in 
B-scan trace image (en face image) are changing as the eye moves. Lateral shifts of the B- 
scan trace match well with the TSLO eye motion trace that is located on the left side of the 
figure (blue trace), whereas vertical motion of the eye (green trace) causes the intensity profile 
of the B-scan to change over time. The OCT B-scan images were also used to estimate the 
lateral eye motion. Cross-correlation of the first B-scan image with subsequent B-scan images 
was used to calculate the lateral and axial displacement, where the axial motion was ignored. 
TSLO and OCT B-scan image motion analysis (shown on the left and right side of the B-scan 
trace, respectively) are in a good agreement. The standard deviation of the eye trace data is 4.7 
arcmin for horizontal and 1.9 arcmin for vertical motion (vector sum being 5.1 arcmin). In 

(A) 




-20 0 20 -20 0 20 

Motion (arcmin) Motion (arcmin) 



Fig. 8. B-scan trace image of 250 B-scans consisting of 2000 A-lines/B-scan taken without (A) 
and with (B) tracking. (A) The left graph shows the eye traces extracted from TSLO videos 
where blue is the horizontal motion and green is vertical. The B-scan trace image correlates 
with the blue curve (horizontal motion). On the right, a cross-correlation graph of the OCT data 
is plotted, which matches well with the TSLO horizontal eye trace. In the OCT cross- 
correlation plot, the first B-scan of the data set was taken as a reference frame and consecutive 
frames were cross-correlated against it. Only lateral motion was calculated, axial motion was 
ignored. (B) Same location as the area in (A) but with tracking. A large saccade is present that 
is seen as a peak in the cross-correlation curve and as a clear shift in the B-scan trace image. 
Scale bars indicate 0.5 deg. 
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Fig. 8(B) the same area is imaged as in Fig. 8(A) but with tracking enabled. The motion of the 
eye was similar to the previous measurement (6.8 arcmin horizontal, 2.6 arcmin vertical and 
vector sum 7.3 arcmin), but the standard deviation from the eye motion compensated cross- 
correlation data is reduced from 7.3 arcmin to 0.8 arcmin. This value is higher than shown in 
the model eye data (0.40 arcmin) due, in part, to a large (12 arcmin) untracked horizontal 
saccade that appeared in the data. If this saccade is removed from the data, the reduced motion 
gets a value of 0.32 arcmin. 

The same subject was used to test the benefits of tracking on the acquisition of OCT 
volume images. To visualize the benefit, each acquired B-scan in a volume data set was 
integrated over depth to create an en face image. Actual eye motion traces were extracted 
from the simultaneously acquired TSLO video for each OCT volume. In Fig. 9 four different 
en face images are shown. Figures 9(A) and 9(B) are taken from an area of 3.11 x 3.11 mm 



(A) no tracking (B) tracking 




-20 0 20 -20 0 20 

Motion (arcmin) Motion (arcmin) 




(C) no tracking (D) tracking 




-20 0 20 -20 0 20 

Motion (arcmin) Motion (arcmin) 



Fig. 9. En face images from 4 different volume data sets. (A) Large field of view (10.6° or 3.1 1 
x 3.1 1 mm) was imaged without tracking enabled. On the left of the image A the corresponding 
eye traces are plotted. Below the image, three different areas are shown as zoomed versions 
from the large image. (B) Same area imaged as in A but tracking was enabled. Enlarged areas 
show that the motion artifacts are compensated. (C) The smaller field of view (5.3° or 1.56 x 
1.56 mm) clearly demonstrates eye motion artifacts. One large saccade causes the scanning 
grid to acquire data from different position. (D) Same area imaged as in C but with tracking 
enabled. In this data set there is a large saccade which is tracked well. However, the final image 
still shows artifacts from brief tracking failures. To fully compensate large eye motions, a 
validity signal needs to be used to rescan the areas that are affected by improper tracking (see 
Fig. 10). Scale bars indicate 0.5 deg. 
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(10.6°) with a scanning protocol of 1000 B-scans, each consisting of 1000 A-lines. On the left 
side of each image the corresponding eye traces (blue = horizontal, green = vertical) are 
plotted. Three areas are zoomed to show how the motion distorts the vessel patterns and how 
with tracking these vessel patterns are undistorted. In Figs. 9(C) and 9(D) smaller areas were 
imaged with a field size of 1.56 x 1.56 mm (5.3°). Both images show a large saccade and with 
the tracked version the large eye motion is clearly compensated. Small distortions are still 
visible in the tracking image due to tracking failures. These distortions demonstrate the need 
to implement the validity signal to signal the OCT system to rescan these areas and to remove 
these distortions from the image. 

As said, image distortions from large saccades and data gaps from blinks remain in the 
data set even when tracking is used. For these reasons, tracked volume scans with the validity 
signal (see section 2.2) were taken (700 B-scans, each consisting of 700 A-lines). In Fig. 10 
two areas are shown of 2.2 x 2.2 mm (6.9°) and 2.7 x 2.7 mm (8.8°) from retinal vessels and 
the ONH respectively. The upper images show the whole data set, including the B-scans that 
are logged as corrupted. These corrupt B-scans are removed in the bottom images to generate 
a new en face image. The benefits of tracking and validity signal are clearly seen in these 
images. 



Raw data with corrupted B-scans 




Corrupted B-scans removed 




Fig. 10. Tracked en face images with validity signal. When tracking software lost tracking due 
to a large saccade or blink, OCT-PC was signaled to step back 10 B-scans and hold that 
position until tracking was locked again on target. B-scans collected during tracking failure are 
removed in post-processing and replaced with rescanned counterparts. {Top images) En face of 
all acquired B-scans is shown. This includes B-scans acquired during large saccades or blinks. 
The blink can be seen as a black line in the upper left image. {Bottom images) Motion or blink 
corrupted B-scans are removed from the volume data set. The black line caused by the blink is 
gone and several large saccades are also removed. Scale bars indicate 1 deg. 
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In addition, a fly-through video of the ONH along the axial direction was made. To reduce 
speckle noise, 4 volume data sets from the same location were compounded together with 
image registration and then averaged to produce a high quality movie from the ONH. From 
this movie, C-scans from 4 different depths are shown in Fig. 1 1 . The depths of the C-scans 
are indicated in a B-scan with colored lines. The fly-through video can be downloaded from 
the caption in Fig. 1 1 . 

The high SNR, the small roll-off (6 dB at 4.2 mm depth [28]) and the use of 1050 nm 
wavelength provide a deep penetration into the tissue and the mesh-like structure of the 
lamina cribrosa is clearly resolvable in all depth slices. With tracking, we were able to 
visualize the porous structure of lamina cribrosa at larger depths, approximately 430 jum 
below the bottom of the ONH cup (indicated by a white dashed line in Fig. 10). In addition to 
accurate volume averaging, tracking minimizes the motion artifacts effectively and this is seen 
as a high quality in the obtained C-scans. The features seen in the C-scans are consistent with 
the known anatomy of the optic disc [36,37] 




Fig. 11. C-scans extracted from different depths from the ONH movie (Low resolution: 
Media 1 (3.9 MB), High resolution: Media 2 (17 MB)). Four different data sets were 
compounded together via image registration to enhance the SNR. The laminar cribrosa mesh- 
like structure is clearly seen in all selected depths and the pore size increases when moving 
further away from the center of the optic disc. On the left of the figure, a B-scan is shown to 
illustrate at which depth each slice (C-scan) was taken (white dashed line indicates the 
reference point). The porous structure of lamina cribrosa is still visible even at depth of 429 urn 
measured from the bottom of the ONH cup. 

4. Discussion 

To summarize the results, we were able to reduce the motion from 17.4 to 0.40 minutes of arc 
in a model eye and from 7.3 arcmin to 0.32 arcmin in a real eye in OCT images by using the 
TSLO. The real eye residual SD motion value is less than a third of the theoretical OCT spot 
diameter on the retina. Without using the validity signal, we were able to track drift and small 
saccades properly. After implementing the validity signal, locations with large saccades and 
blinks were rescanned. This allowed the gathering of data sets without gaps and minimal 
artifacts from tracking failures. 
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Tracking OCT can be advantageous for routine clinical use, but also for patients who have 
weakened fixation capabilities due to a disease, age or recent trauma in the eye. Tracking may 
be most useful for advanced OCT applications such as Doppler OCT [38] (for imaging the 
vasculature) and polarization-sensitive OCT [39] because it allows one to record or integrate 
motion-corrected data over long time periods. Since the scanning grid is locked on target, 
several volume data sets can be taken from the same area and averaged to reduce the speckle 
noise. Moreover, adaptive optics OCT can greatly benefit from tracking since it is known to 
be vulnerable to eye motion because of the small field size [40]. 

Several factors limit the performance of the system and are described here. 

Latency: The 2.5 ms latency between measurement and scanner correction has a 
significant impact on tracking performance. In fact, frequencies above 64 Hz eye motion 
errors would get amplified. To improve the tracking bandwidth to cover the range of typical 
saccades and prevent eye motion amplification at frequencies above 64 Hz, latency should be 
decreased. 

Vertical motion: If the eye moves too far vertically, there will be strips from the top or 
bottom of the current TSLO frame that do not overlap with the reference frame, resulting in a 
loss of valid tracking signal. In the current implementation, the TSLO system extrapolates the 
estimate of eye position during these periods from the last valid measure as well as generates 
an invalid tracking signal. 

Reference frame distortion: Every frame in the TSLO video is distorted from eye motion, 
including the reference frame. If the reference frame contains distortions from a large saccade, 
the stabilization will continuously fail. Small distortions in the reference frame often enable 
good stabilization but the distortions give rise to periodic motion artifacts in the eye trace at 
the frame rate (30 Hz) and subsequently to the OCT scanner. Reselecting the reference frame 
will often minimize this problem. 

Torsion: The TSLO does not measure rotation of the eye around its optical axis, or 
torsion. Torsional movements of the eye during tracking also give rise to periodic eye motion 
artifacts at the frame rate (30 Hz). Torsional eye motion in a fixating eye has a standard 
deviation of less than 0.25 degrees [41] and so has a relatively small effect on the tracking 
signal. 

Every listed factor limits the system in a different way. Vertical and torsional eye 
movements may or may not be a problem, depending on the subject. Reference frame 
distortions can be minimized to a negligible level with careful selection by a trained operator. 
The latency is the dominant source of tracking error and can only be corrected by 
improvements in software and hardware. 

5. Conclusion 

In conclusion, we have demonstrated the integration of an experimental SLO eye tracker to 
our existing OFDI system. 

The correction bandwidth of the TOCT system was measured to be 32 Hz. This bandwidth 
was enough to compensate the majority of the eye motion. In addition to the tracking feature, 
a validity signal was implemented to the TSLO system to indicate if the tracking lock fails. 
Locations with invalid signal were rescanned thus B -scans containing rapid saccades and 
blinks could be discarded from the raw data. The system generates OCT images that have 
minimal amount of artifacts from drifts, micro saccades and blinks. 
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